Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 8971 |
| Missing cells | 21805 |
| Missing cells (%) | 14.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.1 MiB |
| Average record size in memory | 124.0 B |
Variable types
| NUM | 15 |
|---|---|
| CAT | 2 |
day365 is highly correlated with day180 and 5 other fields | High correlation |
day180 is highly correlated with day365 and 5 other fields | High correlation |
day545 is highly correlated with day180 and 5 other fields | High correlation |
day730 is highly correlated with day180 and 5 other fields | High correlation |
day1095 is highly correlated with day180 and 5 other fields | High correlation |
day1460 is highly correlated with day180 and 5 other fields | High correlation |
day1825 is highly correlated with day180 and 5 other fields | High correlation |
day180 has 502 (5.6%) missing values | Missing |
day365 has 1054 (11.7%) missing values | Missing |
day545 has 1843 (20.5%) missing values | Missing |
day730 has 2703 (30.1%) missing values | Missing |
day1095 has 4153 (46.3%) missing values | Missing |
day1460 has 5141 (57.3%) missing values | Missing |
day1825 has 6317 (70.4%) missing values | Missing |
api has unique values | Unique |
hybrid_collect has 7370 (82.2%) zeros | Zeros |
slickwater_collect has 3046 (34.0%) zeros | Zeros |
gel_collect has 5405 (60.2%) zeros | Zeros |
Reproduction
| Analysis started | 2020-11-30 18:44:17.532539 |
|---|---|
| Analysis finished | 2020-11-30 18:45:03.652310 |
| Duration | 46.12 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 8971 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.223603369e+12 |
|---|---|
| Minimum | 5.00109742e+12 |
| Maximum | 4.903120131e+13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.1 KiB |
Quantile statistics
| Minimum | 5.00109742e+12 |
|---|---|
| 5-th percentile | 5.123330865e+12 |
| Q1 | 5.123376475e+12 |
| median | 5.12340919e+12 |
| Q3 | 5.123448245e+12 |
| 95-th percentile | 5.12350136e+12 |
| Maximum | 4.903120131e+13 |
| Range | 4.403010389e+13 |
| Interquartile range (IQR) | 71770000 |
Descriptive statistics
| Standard deviation | 9.378947041e+12 |
|---|---|
| Coefficient of variation (CV) | 1.298375141 |
| Kurtosis | 15.92249159 |
| Mean | 7.223603369e+12 |
| Median Absolute Deviation (MAD) | 35460000 |
| Skewness | 4.233064178 |
| Sum | 6.480294582e+16 |
| Variance | 8.796464759e+25 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 5.12343481e+12 | 1 | < 0.1% | |
| 5.12339353e+12 | 1 | < 0.1% | |
| 5.12345533e+12 | 1 | < 0.1% | |
| 5.1234095e+12 | 1 | < 0.1% | |
| 5.12348042e+12 | 1 | < 0.1% | |
| 5.12343002e+12 | 1 | < 0.1% | |
| 5.12336899e+12 | 1 | < 0.1% | |
| 5.12333424e+12 | 1 | < 0.1% | |
| 5.12337063e+12 | 1 | < 0.1% | |
| 5.12339236e+12 | 1 | < 0.1% | |
| Other values (8961) | 8961 | 99.9% |
| Value | Count | Frequency (%) | |
| 5.00109742e+12 | 1 | < 0.1% | |
| 5.00109753e+12 | 1 | < 0.1% | |
| 5.00109754e+12 | 1 | < 0.1% | |
| 5.0010976e+12 | 1 | < 0.1% | |
| 5.00109772e+12 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4.903120131e+13 | 1 | < 0.1% | |
| 4.903120064e+13 | 1 | < 0.1% | |
| 4.902127986e+13 | 1 | < 0.1% | |
| 4.902127869e+13 | 1 | < 0.1% | |
| 4.902127814e+13 | 1 | < 0.1% |
State
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 70.1 KiB |
| COLORADO | |
|---|---|
| WYOMING | 430 |
| Value | Count | Frequency (%) | |
| COLORADO | 8539 | 95.2% | |
| WYOMING | 430 | 4.8% | |
| (Missing) | 2 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.950953071 |
| Min length | 3 |
TotalCleanVol
Real number (ℝ≥0)
| Distinct | 8723 |
|---|---|
| Distinct (%) | 97.7% |
| Missing | 43 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 164681.1792 |
|---|---|
| Minimum | 2814 |
| Maximum | 1330817 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.1 KiB |
Quantile statistics
| Minimum | 2814 |
|---|---|
| 5-th percentile | 50219.15 |
| Q1 | 77918.5 |
| median | 133340 |
| Q3 | 202932.5 |
| 95-th percentile | 416048.1 |
| Maximum | 1330817 |
| Range | 1328003 |
| Interquartile range (IQR) | 125014 |
Descriptive statistics
| Standard deviation | 121781.9819 |
|---|---|
| Coefficient of variation (CV) | 0.7395015175 |
| Kurtosis | 7.430473158 |
| Mean | 164681.1792 |
| Median Absolute Deviation (MAD) | 59481.5 |
| Skewness | 2.170306418 |
| Sum | 1470273568 |
| Variance | 1.483085112e+10 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 59407 | 32 | 0.4% | |
| 125376 | 3 | < 0.1% | |
| 68894 | 3 | < 0.1% | |
| 95480 | 3 | < 0.1% | |
| 171888 | 2 | < 0.1% | |
| 121410 | 2 | < 0.1% | |
| 151130 | 2 | < 0.1% | |
| 36762 | 2 | < 0.1% | |
| 63958 | 2 | < 0.1% | |
| 139136 | 2 | < 0.1% | |
| Other values (8713) | 8875 | 98.9% | |
| (Missing) | 43 | 0.5% |
| Value | Count | Frequency (%) | |
| 2814 | 1 | < 0.1% | |
| 2826 | 1 | < 0.1% | |
| 4125 | 1 | < 0.1% | |
| 6506 | 1 | < 0.1% | |
| 6545 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1330817 | 1 | < 0.1% | |
| 1271866 | 1 | < 0.1% | |
| 1118923 | 1 | < 0.1% | |
| 1079361 | 1 | < 0.1% | |
| 1057276 | 1 | < 0.1% |
| Distinct | 1585 |
|---|---|
| Distinct (%) | 17.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20177.83135 |
|---|---|
| Minimum | 0 |
| Maximum | 879481 |
| Zeros | 7370 |
| Zeros (%) | 82.2% |
| Memory size | 35.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 139783 |
| Maximum | 879481 |
| Range | 879481 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 58276.72611 |
|---|---|
| Coefficient of variation (CV) | 2.888156072 |
| Kurtosis | 36.76131337 |
| Mean | 20177.83135 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.817451125 |
| Sum | 181015325 |
| Variance | 3396176807 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 7370 | 82.2% | |
| 68894 | 3 | < 0.1% | |
| 28362 | 2 | < 0.1% | |
| 112404 | 2 | < 0.1% | |
| 82227 | 2 | < 0.1% | |
| 255192 | 2 | < 0.1% | |
| 108908 | 2 | < 0.1% | |
| 81285 | 2 | < 0.1% | |
| 66171 | 2 | < 0.1% | |
| 43923 | 2 | < 0.1% | |
| Other values (1575) | 1582 | 17.6% |
| Value | Count | Frequency (%) | |
| 0 | 7370 | 82.2% | |
| 89 | 1 | < 0.1% | |
| 190 | 1 | < 0.1% | |
| 217 | 1 | < 0.1% | |
| 259 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 879481 | 1 | < 0.1% | |
| 871982 | 1 | < 0.1% | |
| 740502 | 1 | < 0.1% | |
| 738313 | 1 | < 0.1% | |
| 714587 | 1 | < 0.1% |
| Distinct | 5824 |
|---|---|
| Distinct (%) | 64.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98162.91941 |
|---|---|
| Minimum | 0 |
| Maximum | 1329061 |
| Zeros | 3046 |
| Zeros (%) | 34.0% |
| Memory size | 35.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 52145 |
| Q3 | 146825.5 |
| 95-th percentile | 379120 |
| Maximum | 1329061 |
| Range | 1329061 |
| Interquartile range (IQR) | 146825.5 |
Descriptive statistics
| Standard deviation | 131860.8044 |
|---|---|
| Coefficient of variation (CV) | 1.343285277 |
| Kurtosis | 6.296602213 |
| Mean | 98162.91941 |
| Median Absolute Deviation (MAD) | 52145 |
| Skewness | 2.124408238 |
| Sum | 880619550 |
| Variance | 1.738727173e+10 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 3046 | 34.0% | |
| 11127 | 31 | 0.3% | |
| 46026 | 3 | < 0.1% | |
| 10666 | 3 | < 0.1% | |
| 72154 | 3 | < 0.1% | |
| 126358 | 2 | < 0.1% | |
| 242116 | 2 | < 0.1% | |
| 83325 | 2 | < 0.1% | |
| 38096 | 2 | < 0.1% | |
| 108542 | 2 | < 0.1% | |
| Other values (5814) | 5875 | 65.5% |
| Value | Count | Frequency (%) | |
| 0 | 3046 | 34.0% | |
| 71 | 1 | < 0.1% | |
| 77 | 1 | < 0.1% | |
| 97 | 2 | < 0.1% | |
| 101 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1329061 | 1 | < 0.1% | |
| 1270116 | 1 | < 0.1% | |
| 1117412 | 1 | < 0.1% | |
| 977225 | 1 | < 0.1% | |
| 975410 | 1 | < 0.1% |
| Distinct | 3468 |
|---|---|
| Distinct (%) | 38.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33540.37588 |
|---|---|
| Minimum | 0 |
| Maximum | 645479 |
| Zeros | 5405 |
| Zeros (%) | 60.2% |
| Memory size | 35.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 51801 |
| 95-th percentile | 169909.5 |
| Maximum | 645479 |
| Range | 645479 |
| Interquartile range (IQR) | 51801 |
Descriptive statistics
| Standard deviation | 64164.47765 |
|---|---|
| Coefficient of variation (CV) | 1.913051836 |
| Kurtosis | 12.3814224 |
| Mean | 33540.37588 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.011582018 |
| Sum | 300890712 |
| Variance | 4117080192 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 5405 | 60.2% | |
| 48280 | 29 | 0.3% | |
| 14492 | 3 | < 0.1% | |
| 108079 | 2 | < 0.1% | |
| 2811 | 2 | < 0.1% | |
| 64521 | 2 | < 0.1% | |
| 74425 | 2 | < 0.1% | |
| 67136 | 2 | < 0.1% | |
| 72334 | 2 | < 0.1% | |
| 52791 | 2 | < 0.1% | |
| Other values (3458) | 3520 | 39.2% |
| Value | Count | Frequency (%) | |
| 0 | 5405 | 60.2% | |
| 24 | 1 | < 0.1% | |
| 30 | 1 | < 0.1% | |
| 35 | 1 | < 0.1% | |
| 40 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 645479 | 1 | < 0.1% | |
| 635955 | 1 | < 0.1% | |
| 602949 | 1 | < 0.1% | |
| 558041 | 1 | < 0.1% | |
| 556810 | 1 | < 0.1% |
Latitude
Real number (ℝ≥0)
| Distinct | 8011 |
|---|---|
| Distinct (%) | 89.3% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.37661924 |
|---|---|
| Minimum | 39.6080407 |
| Maximum | 42.104219 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.1 KiB |
Quantile statistics
| Minimum | 39.6080407 |
|---|---|
| 5-th percentile | 40.01669581 |
| Q1 | 40.1469952 |
| median | 40.35947564 |
| Q3 | 40.49559932 |
| 95-th percentile | 40.9739586 |
| Maximum | 42.104219 |
| Range | 2.4961783 |
| Interquartile range (IQR) | 0.34860412 |
Descriptive statistics
| Standard deviation | 0.3022016851 |
|---|---|
| Coefficient of variation (CV) | 0.007484571289 |
| Kurtosis | 1.989136556 |
| Mean | 40.37661924 |
| Median Absolute Deviation (MAD) | 0.17695407 |
| Skewness | 1.077587371 |
| Sum | 362137.898 |
| Variance | 0.0913258585 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 40.4646113 | 12 | 0.1% | |
| 39.998672 | 12 | 0.1% | |
| 40.20281 | 11 | 0.1% | |
| 40.17203 | 10 | 0.1% | |
| 40.13205959 | 9 | 0.1% | |
| 40.43883 | 9 | 0.1% | |
| 40.37017 | 8 | 0.1% | |
| 40.07354743 | 8 | 0.1% | |
| 40.012206 | 8 | 0.1% | |
| 40.259639 | 8 | 0.1% | |
| Other values (8001) | 8874 | 98.9% |
| Value | Count | Frequency (%) | |
| 39.6080407 | 1 | < 0.1% | |
| 39.6129 | 1 | < 0.1% | |
| 39.613011 | 1 | < 0.1% | |
| 39.6142147 | 1 | < 0.1% | |
| 39.61454469 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 42.104219 | 1 | < 0.1% | |
| 42.059405 | 1 | < 0.1% | |
| 42.045853 | 1 | < 0.1% | |
| 42.032108 | 1 | < 0.1% | |
| 42.003112 | 1 | < 0.1% |
Longitude
Real number (ℝ)
| Distinct | 8187 |
|---|---|
| Distinct (%) | 91.3% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -104.6121814 |
|---|---|
| Minimum | -105.9919448 |
| Maximum | -103.7248046 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.1 KiB |
Quantile statistics
| Minimum | -105.9919448 |
|---|---|
| 5-th percentile | -104.9969189 |
| Q1 | -104.8410646 |
| median | -104.671662 |
| Q3 | -104.44798 |
| 95-th percentile | -103.8842004 |
| Maximum | -103.7248046 |
| Range | 2.2671402 |
| Interquartile range (IQR) | 0.3930846 |
Descriptive statistics
| Standard deviation | 0.3062217423 |
|---|---|
| Coefficient of variation (CV) | -0.002927209223 |
| Kurtosis | 0.55704051 |
| Mean | -104.6121814 |
| Median Absolute Deviation (MAD) | 0.189027 |
| Skewness | 0.9753312732 |
| Sum | -938266.6552 |
| Variance | 0.09377175546 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| -104.7667825 | 12 | 0.1% | |
| -104.957139 | 12 | 0.1% | |
| -104.926457 | 12 | 0.1% | |
| -104.58137 | 11 | 0.1% | |
| -104.6668 | 11 | 0.1% | |
| -104.53751 | 10 | 0.1% | |
| -104.7738264 | 10 | 0.1% | |
| -104.60033 | 10 | 0.1% | |
| -104.933139 | 10 | 0.1% | |
| -104.6291022 | 9 | 0.1% | |
| Other values (8177) | 8862 | 98.8% |
| Value | Count | Frequency (%) | |
| -105.9919448 | 1 | < 0.1% | |
| -105.9359024 | 1 | < 0.1% | |
| -105.054117 | 1 | < 0.1% | |
| -105.0538534 | 1 | < 0.1% | |
| -105.0537528 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| -103.7248046 | 1 | < 0.1% | |
| -103.734464 | 1 | < 0.1% | |
| -103.74255 | 1 | < 0.1% | |
| -103.7539064 | 1 | < 0.1% | |
| -103.7614002 | 1 | < 0.1% |
formation
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.1 KiB |
| NIOBRARA | |
|---|---|
| CODELL | |
| GREENHORN | 10 |
| SUSSEX | 2 |
| Value | Count | Frequency (%) | |
| NIOBRARA | 6835 | 76.2% | |
| CODELL | 2124 | 23.7% | |
| GREENHORN | 10 | 0.1% | |
| SUSSEX | 2 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.527143016 |
| Min length | 6 |
| Distinct | 8036 |
|---|---|
| Distinct (%) | 94.9% |
| Missing | 502 |
| Missing (%) | 5.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47451.94568 |
|---|---|
| Minimum | 408 |
| Maximum | 216956 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.1 KiB |
Quantile statistics
| Minimum | 408 |
|---|---|
| 5-th percentile | 15730.4 |
| Q1 | 29472 |
| median | 42702 |
| Q3 | 61138 |
| 95-th percentile | 94037 |
| Maximum | 216956 |
| Range | 216548 |
| Interquartile range (IQR) | 31666 |
Descriptive statistics
| Standard deviation | 24837.41903 |
|---|---|
| Coefficient of variation (CV) | 0.5234225629 |
| Kurtosis | 1.76366997 |
| Mean | 47451.94568 |
| Median Absolute Deviation (MAD) | 15083 |
| Skewness | 1.067613116 |
| Sum | 401870528 |
| Variance | 616897383.9 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 48690 | 4 | < 0.1% | |
| 38685 | 4 | < 0.1% | |
| 38998 | 3 | < 0.1% | |
| 45767 | 3 | < 0.1% | |
| 48821 | 3 | < 0.1% | |
| 32262 | 3 | < 0.1% | |
| 38205 | 3 | < 0.1% | |
| 30735 | 3 | < 0.1% | |
| 50356 | 3 | < 0.1% | |
| 44254 | 3 | < 0.1% | |
| Other values (8026) | 8437 | 94.0% | |
| (Missing) | 502 | 5.6% |
| Value | Count | Frequency (%) | |
| 408 | 1 | < 0.1% | |
| 449 | 1 | < 0.1% | |
| 453 | 1 | < 0.1% | |
| 567 | 1 | < 0.1% | |
| 606 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 216956 | 1 | < 0.1% | |
| 199184 | 1 | < 0.1% | |
| 170137 | 1 | < 0.1% | |
| 168677 | 1 | < 0.1% | |
| 168211 | 1 | < 0.1% |
| Distinct | 7660 |
|---|---|
| Distinct (%) | 96.8% |
| Missing | 1054 |
| Missing (%) | 11.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70122.33649 |
|---|---|
| Minimum | 215 |
| Maximum | 414735 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.1 KiB |
Quantile statistics
| Minimum | 215 |
|---|---|
| 5-th percentile | 22642.6 |
| Q1 | 41944 |
| median | 61445 |
| Q3 | 90494 |
| 95-th percentile | 144093.6 |
| Maximum | 414735 |
| Range | 414520 |
| Interquartile range (IQR) | 48550 |
Descriptive statistics
| Standard deviation | 39442.4881 |
|---|---|
| Coefficient of variation (CV) | 0.5624810876 |
| Kurtosis | 3.287234184 |
| Mean | 70122.33649 |
| Median Absolute Deviation (MAD) | 22460 |
| Skewness | 1.350948135 |
| Sum | 555158538 |
| Variance | 1555709867 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 53179 | 3 | < 0.1% | |
| 57306 | 3 | < 0.1% | |
| 47594 | 3 | < 0.1% | |
| 47780 | 3 | < 0.1% | |
| 85022 | 3 | < 0.1% | |
| 70381 | 3 | < 0.1% | |
| 24571 | 2 | < 0.1% | |
| 48622 | 2 | < 0.1% | |
| 38325 | 2 | < 0.1% | |
| 18608 | 2 | < 0.1% | |
| Other values (7650) | 7891 | 88.0% | |
| (Missing) | 1054 | 11.7% |
| Value | Count | Frequency (%) | |
| 215 | 1 | < 0.1% | |
| 858 | 1 | < 0.1% | |
| 1697 | 1 | < 0.1% | |
| 1856 | 1 | < 0.1% | |
| 1903 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 414735 | 1 | < 0.1% | |
| 354975 | 1 | < 0.1% | |
| 299194 | 1 | < 0.1% | |
| 293085 | 1 | < 0.1% | |
| 292043 | 1 | < 0.1% |
| Distinct | 6934 |
|---|---|
| Distinct (%) | 97.3% |
| Missing | 1843 |
| Missing (%) | 20.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81492.62767 |
|---|---|
| Minimum | 215 |
| Maximum | 509699 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.1 KiB |
Quantile statistics
| Minimum | 215 |
|---|---|
| 5-th percentile | 26688.15 |
| Q1 | 48398.25 |
| median | 70089 |
| Q3 | 104418 |
| 95-th percentile | 171842.7 |
| Maximum | 509699 |
| Range | 509484 |
| Interquartile range (IQR) | 56019.75 |
Descriptive statistics
| Standard deviation | 47229.21373 |
|---|---|
| Coefficient of variation (CV) | 0.5795519802 |
| Kurtosis | 4.375914711 |
| Mean | 81492.62767 |
| Median Absolute Deviation (MAD) | 25589 |
| Skewness | 1.533765439 |
| Sum | 580879450 |
| Variance | 2230598630 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 51885 | 3 | < 0.1% | |
| 46099 | 3 | < 0.1% | |
| 35477 | 3 | < 0.1% | |
| 55479 | 3 | < 0.1% | |
| 87859 | 3 | < 0.1% | |
| 132697 | 2 | < 0.1% | |
| 77599 | 2 | < 0.1% | |
| 33429 | 2 | < 0.1% | |
| 69832 | 2 | < 0.1% | |
| 89134 | 2 | < 0.1% | |
| Other values (6924) | 7103 | 79.2% | |
| (Missing) | 1843 | 20.5% |
| Value | Count | Frequency (%) | |
| 215 | 1 | < 0.1% | |
| 2126 | 1 | < 0.1% | |
| 2544 | 1 | < 0.1% | |
| 3003 | 1 | < 0.1% | |
| 3354 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 509699 | 1 | < 0.1% | |
| 460222 | 1 | < 0.1% | |
| 415193 | 1 | < 0.1% | |
| 364908 | 1 | < 0.1% | |
| 364223 | 1 | < 0.1% |
| Distinct | 6129 |
|---|---|
| Distinct (%) | 97.8% |
| Missing | 2703 |
| Missing (%) | 30.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 87848.79627 |
|---|---|
| Minimum | 215 |
| Maximum | 526188 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.1 KiB |
Quantile statistics
| Minimum | 215 |
|---|---|
| 5-th percentile | 29242.95 |
| Q1 | 52687 |
| median | 74740.5 |
| Q3 | 111189 |
| 95-th percentile | 188100.25 |
| Maximum | 526188 |
| Range | 525973 |
| Interquartile range (IQR) | 58502 |
Descriptive statistics
| Standard deviation | 51899.0393 |
|---|---|
| Coefficient of variation (CV) | 0.5907768974 |
| Kurtosis | 4.770352164 |
| Mean | 87848.79627 |
| Median Absolute Deviation (MAD) | 26933 |
| Skewness | 1.669444134 |
| Sum | 550636255 |
| Variance | 2693510280 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 33195 | 3 | < 0.1% | |
| 52716 | 3 | < 0.1% | |
| 71080 | 3 | < 0.1% | |
| 64658 | 2 | < 0.1% | |
| 58416 | 2 | < 0.1% | |
| 82867 | 2 | < 0.1% | |
| 74574 | 2 | < 0.1% | |
| 59841 | 2 | < 0.1% | |
| 88740 | 2 | < 0.1% | |
| 87808 | 2 | < 0.1% | |
| Other values (6119) | 6245 | 69.6% | |
| (Missing) | 2703 | 30.1% |
| Value | Count | Frequency (%) | |
| 215 | 1 | < 0.1% | |
| 2486 | 1 | < 0.1% | |
| 3444 | 1 | < 0.1% | |
| 3587 | 1 | < 0.1% | |
| 3875 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 526188 | 1 | < 0.1% | |
| 466572 | 1 | < 0.1% | |
| 415841 | 1 | < 0.1% | |
| 405327 | 1 | < 0.1% | |
| 404653 | 1 | < 0.1% |
| Distinct | 4735 |
|---|---|
| Distinct (%) | 98.3% |
| Missing | 4153 |
| Missing (%) | 46.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 91696.94105 |
|---|---|
| Minimum | 5045 |
| Maximum | 434958 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.1 KiB |
Quantile statistics
| Minimum | 5045 |
|---|---|
| 5-th percentile | 32723.1 |
| Q1 | 57481.75 |
| median | 79843 |
| Q3 | 113526 |
| 95-th percentile | 193168.4 |
| Maximum | 434958 |
| Range | 429913 |
| Interquartile range (IQR) | 56044.25 |
Descriptive statistics
| Standard deviation | 51004.63675 |
|---|---|
| Coefficient of variation (CV) | 0.556230515 |
| Kurtosis | 3.626803989 |
| Mean | 91696.94105 |
| Median Absolute Deviation (MAD) | 26325 |
| Skewness | 1.525378144 |
| Sum | 441795862 |
| Variance | 2601472970 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 74339 | 2 | < 0.1% | |
| 38667 | 2 | < 0.1% | |
| 45209 | 2 | < 0.1% | |
| 64094 | 2 | < 0.1% | |
| 77385 | 2 | < 0.1% | |
| 104340 | 2 | < 0.1% | |
| 62268 | 2 | < 0.1% | |
| 57794 | 2 | < 0.1% | |
| 52253 | 2 | < 0.1% | |
| 84085 | 2 | < 0.1% | |
| Other values (4725) | 4798 | 53.5% | |
| (Missing) | 4153 | 46.3% |
| Value | Count | Frequency (%) | |
| 5045 | 1 | < 0.1% | |
| 5253 | 1 | < 0.1% | |
| 6695 | 1 | < 0.1% | |
| 6972 | 1 | < 0.1% | |
| 7857 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 434958 | 1 | < 0.1% | |
| 418096 | 1 | < 0.1% | |
| 414870 | 1 | < 0.1% | |
| 396570 | 1 | < 0.1% | |
| 371447 | 1 | < 0.1% |
| Distinct | 3770 |
|---|---|
| Distinct (%) | 98.4% |
| Missing | 5141 |
| Missing (%) | 57.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 93808.23133 |
|---|---|
| Minimum | 6620 |
| Maximum | 372235 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.1 KiB |
Quantile statistics
| Minimum | 6620 |
|---|---|
| 5-th percentile | 36088.75 |
| Q1 | 60799 |
| median | 82680 |
| Q3 | 114853.25 |
| 95-th percentile | 191336.55 |
| Maximum | 372235 |
| Range | 365615 |
| Interquartile range (IQR) | 54054.25 |
Descriptive statistics
| Standard deviation | 49532.09836 |
|---|---|
| Coefficient of variation (CV) | 0.5280144147 |
| Kurtosis | 3.104722613 |
| Mean | 93808.23133 |
| Median Absolute Deviation (MAD) | 25466 |
| Skewness | 1.480056797 |
| Sum | 359285526 |
| Variance | 2453428768 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 32459 | 2 | < 0.1% | |
| 81759 | 2 | < 0.1% | |
| 94621 | 2 | < 0.1% | |
| 80373 | 2 | < 0.1% | |
| 77071 | 2 | < 0.1% | |
| 105690 | 2 | < 0.1% | |
| 54225 | 2 | < 0.1% | |
| 60023 | 2 | < 0.1% | |
| 79714 | 2 | < 0.1% | |
| 71390 | 2 | < 0.1% | |
| Other values (3760) | 3810 | 42.5% | |
| (Missing) | 5141 | 57.3% |
| Value | Count | Frequency (%) | |
| 6620 | 1 | < 0.1% | |
| 7538 | 1 | < 0.1% | |
| 8292 | 1 | < 0.1% | |
| 9015 | 1 | < 0.1% | |
| 9125 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 372235 | 1 | < 0.1% | |
| 370622 | 1 | < 0.1% | |
| 343353 | 1 | < 0.1% | |
| 341103 | 1 | < 0.1% | |
| 338679 | 1 | < 0.1% |
| Distinct | 2632 |
|---|---|
| Distinct (%) | 99.2% |
| Missing | 6317 |
| Missing (%) | 70.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 96202.01959 |
|---|---|
| Minimum | 9278 |
| Maximum | 400751 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.1 KiB |
Quantile statistics
| Minimum | 9278 |
|---|---|
| 5-th percentile | 38378.95 |
| Q1 | 63796.25 |
| median | 86448 |
| Q3 | 116765.75 |
| 95-th percentile | 189775.75 |
| Maximum | 400751 |
| Range | 391473 |
| Interquartile range (IQR) | 52969.5 |
Descriptive statistics
| Standard deviation | 48241.85126 |
|---|---|
| Coefficient of variation (CV) | 0.5014640177 |
| Kurtosis | 3.363213644 |
| Mean | 96202.01959 |
| Median Absolute Deviation (MAD) | 25600.5 |
| Skewness | 1.452819404 |
| Sum | 255320160 |
| Variance | 2327276213 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 71764 | 2 | < 0.1% | |
| 89700 | 2 | < 0.1% | |
| 95254 | 2 | < 0.1% | |
| 44422 | 2 | < 0.1% | |
| 84317 | 2 | < 0.1% | |
| 164906 | 2 | < 0.1% | |
| 68586 | 2 | < 0.1% | |
| 69978 | 2 | < 0.1% | |
| 105282 | 2 | < 0.1% | |
| 84188 | 2 | < 0.1% | |
| Other values (2622) | 2634 | 29.4% | |
| (Missing) | 6317 | 70.4% |
| Value | Count | Frequency (%) | |
| 9278 | 1 | < 0.1% | |
| 9614 | 1 | < 0.1% | |
| 9654 | 1 | < 0.1% | |
| 10538 | 1 | < 0.1% | |
| 11749 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 400751 | 1 | < 0.1% | |
| 351182 | 1 | < 0.1% | |
| 347015 | 1 | < 0.1% | |
| 341354 | 1 | < 0.1% | |
| 333194 | 1 | < 0.1% |
TotalProppant
Real number (ℝ≥0)
| Distinct | 8723 |
|---|---|
| Distinct (%) | 97.7% |
| Missing | 43 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6863428.374 |
|---|---|
| Minimum | 10833 |
| Maximum | 39844907 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 70.1 KiB |
Quantile statistics
| Minimum | 10833 |
|---|---|
| 5-th percentile | 1849439.2 |
| Q1 | 3641288.5 |
| median | 5101441.5 |
| Q3 | 8842257.5 |
| 95-th percentile | 17497438.15 |
| Maximum | 39844907 |
| Range | 39834074 |
| Interquartile range (IQR) | 5200969 |
Descriptive statistics
| Standard deviation | 4954765.145 |
|---|---|
| Coefficient of variation (CV) | 0.7219081886 |
| Kurtosis | 4.431899216 |
| Mean | 6863428.374 |
| Median Absolute Deviation (MAD) | 2207689.5 |
| Skewness | 1.822383067 |
| Sum | 6.127668853e+10 |
| Variance | 2.454969765e+13 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3679086 | 23 | 0.3% | |
| 3750000 | 17 | 0.2% | |
| 3678540 | 11 | 0.1% | |
| 5025000 | 5 | 0.1% | |
| 6020000 | 5 | 0.1% | |
| 4400000 | 5 | 0.1% | |
| 5500000 | 5 | 0.1% | |
| 8397000 | 4 | < 0.1% | |
| 4718000 | 4 | < 0.1% | |
| 16848000 | 4 | < 0.1% | |
| Other values (8713) | 8845 | 98.6% | |
| (Missing) | 43 | 0.5% |
| Value | Count | Frequency (%) | |
| 10833 | 1 | < 0.1% | |
| 86977 | 1 | < 0.1% | |
| 97015 | 1 | < 0.1% | |
| 126750 | 1 | < 0.1% | |
| 129871 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 39844907 | 1 | < 0.1% | |
| 39469757 | 1 | < 0.1% | |
| 38376520 | 1 | < 0.1% | |
| 38275020 | 1 | < 0.1% | |
| 35710970 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| api | State | TotalCleanVol | hybrid_collect | slickwater_collect | gel_collect | Latitude | Longitude | formation | day180 | day365 | day545 | day730 | day1095 | day1460 | day1825 | TotalProppant | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 5123450470000 | COLORADO | 97227.0 | 0 | 91186 | 0 | 40.121508 | -104.902747 | NIOBRARA | 34629.0 | 45574.0 | 52115.0 | 56460.0 | NaN | NaN | NaN | 5720290.0 |
| 1 | 5123320810100 | COLORADO | 17045.0 | 0 | 2173 | 9574 | 40.930917 | -104.467586 | NIOBRARA | 12174.0 | 15003.0 | 16390.0 | 17777.0 | 19082.0 | 20067.0 | 20926.0 | 577200.0 |
| 2 | 5123403830000 | COLORADO | 146513.0 | 0 | 64588 | 0 | 40.369187 | -104.524494 | NIOBRARA | 44756.0 | 65665.0 | 79842.0 | 89154.0 | 103961.0 | NaN | NaN | 8452600.0 |
| 3 | 5123376830000 | COLORADO | 102024.0 | 0 | 102024 | 0 | 40.010014 | -104.791563 | CODELL | 39490.0 | 55857.0 | 67039.0 | 76653.0 | 90541.0 | 100144.0 | 107214.0 | 3406480.0 |
| 4 | 5123360060000 | COLORADO | 184744.0 | 0 | 160288 | 21992 | 40.073949 | -104.710424 | NIOBRARA | 60833.0 | 88075.0 | 105605.0 | 119686.0 | 135440.0 | 146147.0 | 154308.0 | 5624030.0 |
| 5 | 5123377060000 | COLORADO | 65210.0 | 0 | 65210 | 0 | 40.509264 | -104.779784 | NIOBRARA | 10544.0 | 18756.0 | 22896.0 | 24975.0 | 28069.0 | 30241.0 | NaN | 4195880.0 |
| 6 | 5123377340000 | COLORADO | 65892.0 | 0 | 22832 | 43060 | 40.509263 | -104.779874 | NIOBRARA | 11366.0 | 16721.0 | 19514.0 | 23535.0 | 29542.0 | 33470.0 | NaN | 4256820.0 |
| 7 | 5123440840000 | COLORADO | 127878.0 | 0 | 31866 | 94333 | 40.365409 | -104.629102 | NIOBRARA | 57339.0 | 81471.0 | NaN | NaN | NaN | NaN | NaN | 7262000.0 |
| 8 | 5123453750000 | COLORADO | 366038.0 | 0 | 365236 | 0 | 40.151840 | -104.534823 | NIOBRARA | 87895.0 | 146091.0 | 180770.0 | NaN | NaN | NaN | NaN | 10929589.0 |
| 9 | 5123372760000 | COLORADO | 79320.0 | 0 | 42570 | 36000 | 40.518015 | -104.720354 | CODELL | 32284.0 | 45911.0 | 54622.0 | 60890.0 | 69639.0 | 76069.0 | 82387.0 | 3526080.0 |
Last rows
| api | State | TotalCleanVol | hybrid_collect | slickwater_collect | gel_collect | Latitude | Longitude | formation | day180 | day365 | day545 | day730 | day1095 | day1460 | day1825 | TotalProppant | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8961 | 5123477070000 | COLORADO | 606572.0 | 0 | 587236 | 0 | 40.217228 | -104.587743 | NIOBRARA | 127355.0 | 189428.0 | NaN | NaN | NaN | NaN | NaN | 19189890.0 |
| 8962 | 5123416450000 | COLORADO | 183191.0 | 0 | 182881 | 0 | 40.853957 | -103.800187 | NIOBRARA | 38429.0 | 74895.0 | 89690.0 | 95603.0 | NaN | NaN | NaN | 4792428.0 |
| 8963 | 5123426110000 | COLORADO | 120302.0 | 0 | 117052 | 0 | 40.071857 | -104.777480 | CODELL | 69346.0 | 101079.0 | 118978.0 | 130420.0 | NaN | NaN | NaN | 3520900.0 |
| 8964 | 5123448650000 | COLORADO | 306220.0 | 0 | 288617 | 0 | 40.066461 | -104.965488 | NIOBRARA | 105012.0 | 164847.0 | 189603.0 | NaN | NaN | NaN | NaN | 13335390.0 |
| 8965 | 5123460330000 | COLORADO | 600917.0 | 0 | 600806 | 0 | 40.541986 | -104.759331 | NIOBRARA | 42116.0 | 76037.0 | NaN | NaN | NaN | NaN | NaN | 13186000.0 |
| 8966 | 49021210600000 | WYOMING | 113084.0 | 0 | 0 | 113084 | 41.101122 | -104.682676 | CODELL | 69240.0 | 80651.0 | 94600.0 | 108408.0 | 122126.0 | 128154.0 | 133519.0 | 10987347.0 |
| 8967 | 49021211170000 | WYOMING | 130089.0 | 0 | 60082 | 67906 | 41.298136 | -104.633688 | CODELL | 78702.0 | 120979.0 | 145865.0 | 157424.0 | 188097.0 | 215945.0 | 237755.0 | 6204780.0 |
| 8968 | 5001103760000 | COLORADO | 147619.0 | 0 | 0 | 147619 | 39.998672 | -104.851392 | NIOBRARA | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 6844000.0 |
| 8969 | 5123462860000 | COLORADO | 215601.0 | 0 | 0 | 215284 | 40.460545 | -104.766783 | NIOBRARA | 58227.0 | 87562.0 | NaN | NaN | NaN | NaN | NaN | 12501070.0 |
| 8970 | 5123410380000 | COLORADO | 148777.0 | 0 | 0 | 0 | 40.714772 | -104.035384 | NIOBRARA | 33952.0 | 51954.0 | 62651.0 | 70320.0 | 81318.0 | NaN | NaN | 5279549.0 |